Automatic Parallelization of the AVL FIRE Benchmark for a Distributed-Memory System

نویسندگان

  • Peter Brezany
  • Viera Sipková
  • Barbara M. Chapman
  • Robert Greimel
چکیده

Computational uid dynamics (CFD) is a Grand Challenge discipline whose typical application areas, like aerospace and automotive engineering, often require enormous amount of computations. Parallel processing ooers very high performance potential, but irregular problems like CFD have proven diicult to map onto parallel machines. In such codes, access patterns to major data arrays are dependent on some run-time data, therefore runtime preprocessing must be applied on critical code segments. So, automatic parallelization of irregular codes is a challenging problem. In this paper we describe parallelizing techniques we have developed for processing irregular codes that include irregularly distributed data structures. These techniques have been fully implemented within the Vienna Fortran Compilation System. We have examined the AVL FIRE benchmark solver GCCG, to evaluate the innuence of diierent kinds of data distributions on parallel-program execution time. Experiments were performed using the Tjunc dataset on the iPSC/860.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayes Networks and Fault Tree Analysis Application in Reliability Estimation (Case Study: Automatic Water Sprinkler System)

In this study, the application of Bayes networks and fault tree analysis in reliability estimation have been investigated. Fault tree analysis is one of the most widely used methods for estimating reliability. In recent years, a method called "Bayes Network" has been used, which is a dynamic method, and information about the probable failure of the system components will be updated according to...

متن کامل

Data Parallel Iterators for Hierarchical Grid and Tree Algorithms

The data parallel programming language construct of a “foreach” loop is proposed in the context of hierarchically nested arrays and unbalanced k-ary trees used in high performance applications. In order perform an initial evaluation, an implementation of an automatic parallelization system for C++ programs is introduced, which consists of a preprocessor and a matching library for distributed me...

متن کامل

Experimental Study of Compiler Techniques for NUMA Machines

This study explores the applicability of fully automatic parallelizing techniques for parallel computers. In this study, we capitalize on a variety of traditional compiling techniques as well as new techniques developed specifically for distributed memory architectures. Combining these traditional and new techniques, we conducted experiments with several benchmark programs on the Cray T3D.

متن کامل

Comparison Of Automatic And Manual Techniques For Distributed Computing

Manual Parallelization, Understand the Problem and the Program Distributed memory systems require a communication network to connect May be able to be used in conjunction with some degree of automatic parallelization. To enhance the technique, the structure of a complex network can be extended. A comparison analysis of environmental factors affecting software reliability Automatic detection of ...

متن کامل

Computer Science Technical Report Canonic Multi-Projection: Memory Allocation for Distributed Memory Parallelization

The Polyhedral model is now the accepted technology for automatic parallelization of affine control loop programs. It has been successful in automatically generating tiled shared memory parallel programs for shared memory platforms (plus vectorization). We address the challenges arising when we move toward distributed memory parallelization, based on wavefront execution of parameterized tiles. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995